Gene Ontology Evidence Sentence Retrieval Using Combinatorial Applications of Semantic Class and Rule Patterns

نویسندگان

  • Jian-Ming Chen
  • Yung-Chun Chang
  • Johnny Chi-Yang Wu
  • Po-Ting Lai
  • Hong-Jie Dai
چکیده

Gene Ontology (GO) provides helpful information with respect to biological process, molecular function and cellular component in annotating the relationships among gene, chemical and disease. Due to the complexity of GO knowledge, developing automated or semi-automated GO curation techniques remains to be a big challenge for database curators. In order to efficiently and precisely retrieve GO information from large amount of biomedical resources, we propose a GO evidence sentence retrieval system conducted via combinatorial applications of semantic class and rule patterns to automatically retrieve GO evidence sentences with specific gene mentions from full-length articles. Introduction Gene-oriented biomedical researches constitute the basis of advanced life science researches. Although the phenomenal growth of biomedical studies augmented our apprehension of complex biological mechanisms, the sharing and exchange of these results are hindered by the discrete terminologies and depictions. Therefore, the Gene Ontology (GO) initiative attempts to provide a universal representation of gene products and their correlated attributes. To promote research and tool development for the curation of GO database, BioCreative IV hosted a GO track, with an intention of retrieving GO evidence sentences for relevant genes (SubTask A) and predicting GO terms for relevant genes (SubTask B). In this work, we introduce a combinatorial approach toward the SubTask A of BioCreative IV. In our approach, the subtask is further divided into two subtasks: 1) candidate GO sentence retrieval, which selects the candidate GO sentences from a given full text, and 2) gene entity assignment, which assigns relevant gene mentions to a GO evidence sentence.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Executive Approach Based On the Production of Fuzzy Ontology Using the Semantic Web Rule Language Method (SWRL)

Today, the need to deal with ambiguous information in semantic web languages is increasing. Ontology is an important part of the W3C standards for the semantic web, used to define a conceptual standard vocabulary for the exchange of data between systems, the provision of reusable databases, and the facilitation of collaboration across multiple systems. However, classical ontology is not enough ...

متن کامل

Public Transport Ontology for Passenger Information Retrieval

Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...

متن کامل

Using Tf-isf with Local Context to Generate an Owl Document Representation for Sentence Retrieval

In this paper we combine our previous research in the field of Semantic web, especially ontology learning and population with Sentence retrieval. To do this we developed a new approach to sentence retrieval modifying our previous TF-ISF method which uses local context information to take into account only document level information. This is quite a new approach to sentence retrieval, presented ...

متن کامل

Developing a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information

With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...

متن کامل

BIOSSES: a semantic sentence similarity estimation system for the biomedical domain

Motivation The amount of information available in textual format is rapidly increasing in the biomedical domain. Therefore, natural language processing (NLP) applications are becoming increasingly important to facilitate the retrieval and analysis of these data. Computing the semantic similarity between sentences is an important component in many NLP tasks including text retrieval and summariza...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013